Enriching WordNet with Folk Knowledge and Stereotypes

نویسندگان

  • Tony Veale
  • Yanfen Hao
چکیده

The knowledge that is needed to understand everyday language is not necessarily the knowledge one finds in an encyclopedia or dictionary. Much of this is “folk” knowledge, based on stereotypes and culturally-inherited associations that do not hold in all situations, or which may, strictly speaking, be false. We can open a linguistic window onto this knowledge through simile, since explicit similes make use of highly evocative and inference-rich concepts to ground comparisons and make the unfamiliar seem familiar. This paper describes a means of enriching WordNet with commonly ascribed cultural properties by mining explicit similes of the form "as ADJ as a NOUN" from the internet. We also show how these properties can be leveraged, through further web search, into rich frame structures for the most evocative WordNet concepts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining ConceptNet and WordNet for Word Sense Disambiguation

Knowledge-based Word sense Disambiguation (WSD) methods heavily depend on knowledge. Therefore enriching knowledge is one of the most important issues in WSD. This paper proposes a novel idea of combining WordNet and ConceptNet for WSD. First, we present a novel method to automatically disambiguate the concepts in ConceptNet; and then we enrich WordNet with large amounts of semantic relations f...

متن کامل

Towards Universal Multilingual Knowledge Bases

Lexical, ontological, as well as encyclopedic knowledge is increasingly being encoded in machine-readable form. This paper deals with knowledge representation in multilingual settings. It begins by proposing a generic graph-based knowledge base framework, and then, in three case studies, explains how preexisting knowledge can be cast into this framework. The first case study involves enriching ...

متن کامل

Automatic Assignment of Wikipedia Encyclopedic Entries to WordNet Synsets

We describe an approach taken for automatically associating entries from an on-line encyclopedia with concepts in an ontology or a lexical semantic network. It has been tested with the Simple English Wikipedia and WordNet, although it can be used with other resources. The accuracy in disambiguating the sense of the encyclopedia entries reaches 91.11% (83.89% for polysemous words). It will be ap...

متن کامل

Demo: Enriching Text with RDF/OWL Encoded Senses

This demo paper describes an extension of the Enrycher text enhancement system, which annotates words in context, from a text fragment, with RDF/OWL encoded senses from WordNet and OpenCyc. The extension is based on a general purpose disambiguation algorithm which takes advantage of the structure and/or content of knowledge resources, reaching state-of-the-art performance when compared to other...

متن کامل

Mapping WordNet synsets to Wikipedia articles

Lexical knowledge bases (LKBs), such as WordNet, have been shown to be useful for a range of language processing tasks. Extending these resources is an expensive and time-consuming process. This paper describes an approach to address this problem by automatically generating a mapping from WordNet synsets to Wikipedia articles. A sample of synsets has been manually annotated with article matches...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007